RESAMPLING STUDENT'S t-TYPE STATISTICS
نویسنده
چکیده
The present paper establishes conditional and unconditional central limit theorems for various resampling procedures for the t-statistic. The results work under fairly general conditions and the underlying random variables need not to be independent. Specific examples are then the 're(n) (double) bootstrap out of k(n) observations, the Bayesian bootstrap and two-sample t-type permutation statistics. In case when m(n)/k(n) is bounded away from zero and infinity necessary and sufficient conditions for the conditional central limit law of the bootstrap t-statistics are established. For high resampling intensity when m(n)/k(n) tends to infinity the following general result is obtained. Without further other assumptions the bootstrap makes the resampled t-statistic automatically normal. The results are based on a general conditional limit theorem for weighted resampling statistics which is of own interest. Consider a triangular array of arbitrary real random variables Xn,i, 1 < i < k(n), on some probability space (gt, ,4, P) with k(n)-+ ~x~ as n-~ c~. The one-sample t-statistic is then (1.1) t~= k(~-l/2 ~-~k(n) Xn,i \'~1 A.~i-1 1 ~-]ki:(~)(Xni_-Rn)2) k(n)-1 = ' i/2 given by the mean X~ = 1/k(n) V'k(n) Xn ~. Throughout we will discuss the limit behaviour of various resampling versions of tn and tests of t-test type. Specific examples are all kind of bootstrap and permutation resampling statistics. The results are typically applied in testing statistical hypotheses when critical values of tn are derived by resampling methods under a nonparametric null hypothesis, see Section 5 and Janssen and Pauls (2003). At this stage we like to emphasize that the Xn# may come from arbitrary alternatives which may no longer be independent or identically distributed. In a forthcoming paper the present results will be applied to establish power functions for resampling tests. Two-sample t-statistics are treated similarly in Section 4. All proofs can be found in Section 6. In the next step let us draw re(n) resampling variables X~', , Xm n given the 9 .. () original data Xn#. Below various different resampling schemes are discussed in detail.
منابع مشابه
A Gentle Introduction to Resampling Techniques
2 Overview of resampling 2 Permutation Methods 3 Bootstrapping 3 Monte Carlo 4 Failure of t-test: Girl Scout cookie sales 4 Demonstration of permutations 5 Demonstration of Howell‟s resampling programs 5 Howell‟s example: Hurry up, Fella! 5 parametric t 6 Permutation tests (also called Randomization tests) 6 t as a measure of an effect 6 Howell‟s Resampling program 7 Howell‟s Randomization (per...
متن کاملInferences About the Skipped Correlation Coefficient: Dealing with Heteroscedasticity and Non-Normality
A common goal is testing the hypothesis that Pearson's correlation is zero and typically this is done based on Student's T test. There are, however, several well-known concerns. First, Student's T is sensitive to heteroscedasticity. That is, when it rejects, it is reasonable to conclude that there is dependence, but in terms of making a decision about the strength of the association, it is unsa...
متن کاملNon-Parametric Standard Errors and Tests for Network Statistics
Two procedures are proposed for calculating standard errors for network statistics. Both are based on resampling of vertices: the first follows the bootstrap approach, the second the jackknife approach. In addition, we demonstrate how to use these estimated standard errors to compare statistics using an approximate t-test and how statistics can also be compared by another bootstrap approach tha...
متن کاملSmall Sample Properties of Nonparametric Bootstrap t Confidence Intervals
Confidence interval construction for central tendency is a problem of practical consequence for those who must analyze air contaminant data. Determination of compliance with relevant ambient air quality criteria and assessment of associated health risks depend upon quantifying the uncertainty of estimated mean pollutant concentrations. The bootstrap is a resampling technique that has been stead...
متن کاملEmpirical pathway analysis, without permutation
Resampling-based expression pathway analysis techniques have been shown to preserve type I error rates, in contrast to simple gene-list approaches that implicitly assume the independence of genes in ranked lists. However, resampling is intensive in computation time and memory requirements. We describe accurate analytic approximations to permutations of score statistics, including novel approach...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006